Study on the Classification of Negative Sentiment Weibo Messages in the

نویسندگان

  • H. BAI
  • G. YU
چکیده

Weibo is an extensively used social network tool in China and has become a popular platform for disaster information management. This popular microblogging service offers massive firsthand information regarding the state and emotions of victims in a disaster situation. Identifying negative sentiment messages from the large-scale and noisy Weibo stream is a fundamental and challenging undertaking. Therefore, based on the characteristics of negative Weibo messages concerning disaster events, a novel feature selection algorithm called combined frequent pattern (FP)-growth and mutual information theory (CFM) algorithm, was proposed to improve the traditional machine learning approaches in this study. The CFM algorithm mined two FPs via FP-tree, and the mutual information between two frequent items was calculated to determine the most frequent and tight features for negative-sentiment Weibo messages detection. After that, the experimental analysis was conducted to test the proposed novel feature selection algorithm and to explore a suitable sentiment classifier for disaster-related Weibo messages. The analysis employed actual disaster-related Weibo message data set, which included 2,913 negative messages and 2,913 un-negative messages. Results demonstrate that the CFM algorithm performs well in the feature selection process. In particular, this algorithm exhibits the best performance in the support vector machine classifier with 89.34% accuracy. Therefore, the CFM algorithm is an efficient feature selection algorithm for negative-message classification in a post-disaster situation. This algorithm also offers a novel method to reduce the feature dimension in other text classification areas. Subject Categories and Descriptors I.2.7 [Artificial intelligence]: Natural Language ProcessStudy on the Classification of Negative Sentiment Weibo Messages in the Post-disaster Situation H. BAI1*, G. YU1, XY. TIAN1, 2 1School of Management, Harbin Institute of Technology Heilongjiang, 150001, China 2National Institute for Mental Health Research ANU College of Medicine Biology & Environment, ACT, 2601, Australia [email protected] -ing Text analysis; H.2.8 [Database Applications]: Data mining; General Terms: Chinese text mining, sentiment analysis

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rule-Based Weibo Messages Sentiment Polarity Classification towards Given Topics

Weibo messages sentiment polarity classification towards given topics refers to that the machine automatically classifies whether the weibo message is of positive, negative, or neutral sentiment towards the given topic. The algorithm the sentiment analysis system CUCsas adopts to perform this task includes three steps: (1) whether there is an “exp” (short for “expression having evaluation meani...

متن کامل

A High-Performance Model based on Ensembles for Twitter Sentiment Classification

Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...

متن کامل

Lexicon-Based Sentiment Analysis on Topical Chinese Microblog Messages

Microblogging is a popular social media where people express their opinions and sentiment on social topics. The Chinese microblogging service, called Weibo, has become a remarkable media in the Chinese society. People are eager to know others’ attitudes towards social events, thus sentiment analysis on those topical microblog messages is important. In this paper we introduce a lexicon-based sen...

متن کامل

Sentiment Analysis of Social Networking Data Using Categorized Dictionary

Sentiment analysis is the process of analyzing a person’s perception or belief about a particular subject matter. However, finding correct opinion or interest from multi-facet sentiment data is a tedious task. In this paper, a method to improve the sentiment accuracy by utilizing the concept of categorized dictionary for sentiment classification and analysis is proposed.  A categorized dictiona...

متن کامل

Sentiment Analysis of Chinese Microblog Message using Neural Network-based Vector Representation for Measuring Regional Prejudice

Regional prejudice is prevalent in Chinese cities in which native residents and migrants lack a basic level of trust in the other group. Like Twitter, Sina Weibo is a social media platform where people actively engage in discussions on various social issues. Thus, it provides a good data source for measuring individuals’ regional prejudice on a large scale. We find that a resentful tone dominat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016